Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 114984 |
| Missing cells | 4 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 19.5 MiB |
| Average record size in memory | 178.0 B |
Variable types
| NUM | 13 |
|---|---|
| CAT | 1 |
| BOOL | 1 |
product_category_name_english has a high cardinality: 71 distinct values | High cardinality |
df_index has unique values | Unique |
Reproduction
| Analysis started | 2020-09-12 15:17:42.410387 |
|---|---|
| Analysis finished | 2020-09-12 15:19:09.838029 |
| Duration | 1 minute and 27.43 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 114984 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58212.20914 |
|---|---|
| Minimum | 0 |
| Maximum | 116580 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5815.15 |
| Q1 | 29083.75 |
| median | 58178.5 |
| Q3 | 87345.25 |
| 95-th percentile | 110678.85 |
| Maximum | 116580 |
| Range | 116580 |
| Interquartile range (IQR) | 58261.5 |
Descriptive statistics
| Standard deviation | 33635.58232 |
|---|---|
| Coefficient of variation (CV) | 0.5778097554 |
| Kurtosis | -1.199864531 |
| Mean | 58212.20914 |
| Median Absolute Deviation (MAD) | 29132.5 |
| Skewness | 0.001877420489 |
| Sum | 6693472656 |
| Variance | 1131352398 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 114205 | 1 | < 0.1% | |
| 36155 | 1 | < 0.1% | |
| 46396 | 1 | < 0.1% | |
| 48445 | 1 | < 0.1% | |
| 42302 | 1 | < 0.1% | |
| 44351 | 1 | < 0.1% | |
| 87424 | 1 | < 0.1% | |
| 89473 | 1 | < 0.1% | |
| 83330 | 1 | < 0.1% | |
| 85379 | 1 | < 0.1% | |
| 95620 | 1 | < 0.1% | |
| 97669 | 1 | < 0.1% | |
| 91526 | 1 | < 0.1% | |
| 93575 | 1 | < 0.1% | |
| 71048 | 1 | < 0.1% | |
| 73097 | 1 | < 0.1% | |
| 66954 | 1 | < 0.1% | |
| 69003 | 1 | < 0.1% | |
| 79244 | 1 | < 0.1% | |
| 81293 | 1 | < 0.1% | |
| 75150 | 1 | < 0.1% | |
| 77199 | 1 | < 0.1% | |
| 116114 | 1 | < 0.1% | |
| 103832 | 1 | < 0.1% | |
| Other values (114959) | 114959 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 116580 | 1 | < 0.1% | |
| 116579 | 1 | < 0.1% | |
| 116578 | 1 | < 0.1% | |
| 116577 | 1 | < 0.1% | |
| 116576 | 1 | < 0.1% | |
| 116575 | 1 | < 0.1% | |
| 116574 | 1 | < 0.1% | |
| 116573 | 1 | < 0.1% | |
| 116572 | 1 | < 0.1% | |
| 116571 | 1 | < 0.1% |
price
Real number (ℝ≥0)
| Distinct | 5844 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.4722953 |
|---|---|
| Minimum | 0.85 |
| Maximum | 6735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 39.9 |
| median | 74.9 |
| Q3 | 134 |
| 95-th percentile | 349.9 |
| Maximum | 6735 |
| Range | 6734.15 |
| Interquartile range (IQR) | 94.1 |
Descriptive statistics
| Standard deviation | 183.8195953 |
|---|---|
| Coefficient of variation (CV) | 1.525824629 |
| Kurtosis | 120.288931 |
| Mean | 120.4722953 |
| Median Absolute Deviation (MAD) | 41.91 |
| Skewness | 7.91372638 |
| Sum | 13852386.4 |
| Variance | 33789.6436 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 59.9 | 2572 | 2.2% | |
| 69.9 | 2077 | 1.8% | |
| 49.9 | 2008 | 1.7% | |
| 89.9 | 1606 | 1.4% | |
| 99.9 | 1497 | 1.3% | |
| 29.9 | 1358 | 1.2% | |
| 39.9 | 1323 | 1.2% | |
| 19.9 | 1272 | 1.1% | |
| 79.9 | 1259 | 1.1% | |
| 29.99 | 1209 | 1.1% | |
| 49 | 1191 | 1.0% | |
| 99 | 998 | 0.9% | |
| 149.9 | 893 | 0.8% | |
| 109.9 | 816 | 0.7% | |
| 119.9 | 780 | 0.7% | |
| 99.99 | 737 | 0.6% | |
| 24.9 | 704 | 0.6% | |
| 39.99 | 692 | 0.6% | |
| 35 | 689 | 0.6% | |
| 49.99 | 678 | 0.6% | |
| 34.9 | 661 | 0.6% | |
| 89.99 | 658 | 0.6% | |
| 79 | 650 | 0.6% | |
| 129.9 | 648 | 0.6% | |
| 56.99 | 640 | 0.6% | |
| Other values (5819) | 87368 | 76.0% |
| Value | Count | Frequency (%) | |
| 0.85 | 3 | < 0.1% | |
| 1.2 | 20 | < 0.1% | |
| 2.2 | 2 | < 0.1% | |
| 2.29 | 1 | < 0.1% | |
| 2.9 | 1 | < 0.1% | |
| 2.99 | 1 | < 0.1% | |
| 3.06 | 3 | < 0.1% | |
| 3.49 | 3 | < 0.1% | |
| 3.5 | 6 | < 0.1% | |
| 3.54 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6735 | 1 | < 0.1% | |
| 6729 | 1 | < 0.1% | |
| 6499 | 1 | < 0.1% | |
| 4799 | 1 | < 0.1% | |
| 4690 | 1 | < 0.1% | |
| 4590 | 1 | < 0.1% | |
| 4399.87 | 1 | < 0.1% | |
| 4099.99 | 1 | < 0.1% | |
| 4059 | 1 | < 0.1% | |
| 3999.9 | 1 | < 0.1% |
freight_value
Real number (ℝ≥0)
| Distinct | 6928 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.01331864 |
|---|---|
| Minimum | 0 |
| Maximum | 409.68 |
| Zeros | 386 |
| Zeros (%) | 0.3% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.78 |
| Q1 | 13.08 |
| median | 16.31 |
| Q3 | 21.19 |
| 95-th percentile | 45.2 |
| Maximum | 409.68 |
| Range | 409.68 |
| Interquartile range (IQR) | 8.11 |
Descriptive statistics
| Standard deviation | 15.75213232 |
|---|---|
| Coefficient of variation (CV) | 0.7870824725 |
| Kurtosis | 58.32163924 |
| Mean | 20.01331864 |
| Median Absolute Deviation (MAD) | 3.62 |
| Skewness | 5.551580213 |
| Sum | 2301211.43 |
| Variance | 248.1296725 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 15.1 | 3742 | 3.3% | |
| 7.78 | 2282 | 2.0% | |
| 11.85 | 1942 | 1.7% | |
| 14.1 | 1911 | 1.7% | |
| 18.23 | 1590 | 1.4% | |
| 7.39 | 1545 | 1.3% | |
| 16.11 | 1186 | 1.0% | |
| 15.23 | 1038 | 0.9% | |
| 8.72 | 931 | 0.8% | |
| 16.79 | 899 | 0.8% | |
| 14.52 | 848 | 0.7% | |
| 12.79 | 814 | 0.7% | |
| 10.96 | 714 | 0.6% | |
| 9.34 | 687 | 0.6% | |
| 17.6 | 630 | 0.5% | |
| 12.69 | 618 | 0.5% | |
| 17.67 | 605 | 0.5% | |
| 15.11 | 455 | 0.4% | |
| 11.73 | 447 | 0.4% | |
| 12.48 | 438 | 0.4% | |
| 13.37 | 423 | 0.4% | |
| 17.63 | 417 | 0.4% | |
| 8.88 | 413 | 0.4% | |
| 15.79 | 409 | 0.4% | |
| 19.32 | 404 | 0.4% | |
| Other values (6903) | 89596 | 77.9% |
| Value | Count | Frequency (%) | |
| 0 | 386 | 0.3% | |
| 0.01 | 4 | < 0.1% | |
| 0.02 | 3 | < 0.1% | |
| 0.03 | 14 | < 0.1% | |
| 0.04 | 4 | < 0.1% | |
| 0.05 | 3 | < 0.1% | |
| 0.06 | 13 | < 0.1% | |
| 0.07 | 1 | < 0.1% | |
| 0.08 | 12 | < 0.1% | |
| 0.09 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 409.68 | 1 | < 0.1% | |
| 375.28 | 2 | < 0.1% | |
| 339.59 | 1 | < 0.1% | |
| 338.3 | 1 | < 0.1% | |
| 322.1 | 1 | < 0.1% | |
| 321.88 | 1 | < 0.1% | |
| 321.46 | 1 | < 0.1% | |
| 317.47 | 1 | < 0.1% | |
| 314.4 | 1 | < 0.1% | |
| 314.02 | 1 | < 0.1% |
review_score
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.047937104 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.374089006 |
|---|---|
| Coefficient of variation (CV) | 0.3394541395 |
| Kurtosis | 0.286038152 |
| Mean | 4.047937104 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.295852661 |
| Sum | 465448 |
| Variance | 1.888120598 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5 | 65330 | 56.8% | |
| 4 | 21913 | 19.1% | |
| 1 | 14032 | 12.2% | |
| 3 | 9696 | 8.4% | |
| 2 | 4013 | 3.5% |
| Value | Count | Frequency (%) | |
| 1 | 14032 | 12.2% | |
| 2 | 4013 | 3.5% | |
| 3 | 9696 | 8.4% | |
| 4 | 21913 | 19.1% | |
| 5 | 65330 | 56.8% |
| Value | Count | Frequency (%) | |
| 5 | 65330 | 56.8% | |
| 4 | 21913 | 19.1% | |
| 3 | 9696 | 8.4% | |
| 2 | 4013 | 3.5% | |
| 1 | 14032 | 12.2% |
product_photos_qty
Real number (ℝ≥0)
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.205158979 |
|---|---|
| Minimum | 1 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.717360052 |
|---|---|
| Coefficient of variation (CV) | 0.7787919461 |
| Kurtosis | 4.820190098 |
| Mean | 2.205158979 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.908646709 |
| Sum | 253558 |
| Variance | 2.94932555 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 58135 | 50.6% | |
| 2 | 22736 | 19.8% | |
| 3 | 12810 | 11.1% | |
| 4 | 8729 | 7.6% | |
| 5 | 5520 | 4.8% | |
| 6 | 3895 | 3.4% | |
| 7 | 1536 | 1.3% | |
| 8 | 765 | 0.7% | |
| 10 | 347 | 0.3% | |
| 9 | 314 | 0.3% | |
| 11 | 73 | 0.1% | |
| 12 | 59 | 0.1% | |
| 13 | 30 | < 0.1% | |
| 17 | 11 | < 0.1% | |
| 15 | 11 | < 0.1% | |
| 14 | 6 | < 0.1% | |
| 18 | 4 | < 0.1% | |
| 19 | 2 | < 0.1% | |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 58135 | 50.6% | |
| 2 | 22736 | 19.8% | |
| 3 | 12810 | 11.1% | |
| 4 | 8729 | 7.6% | |
| 5 | 5520 | 4.8% | |
| 6 | 3895 | 3.4% | |
| 7 | 1536 | 1.3% | |
| 8 | 765 | 0.7% | |
| 9 | 314 | 0.3% | |
| 10 | 347 | 0.3% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 19 | 2 | < 0.1% | |
| 18 | 4 | < 0.1% | |
| 17 | 11 | < 0.1% | |
| 15 | 11 | < 0.1% | |
| 14 | 6 | < 0.1% | |
| 13 | 30 | < 0.1% | |
| 12 | 59 | 0.1% | |
| 11 | 73 | 0.1% | |
| 10 | 347 | 0.3% |
product_weight_g
Real number (ℝ≥0)
| Distinct | 2188 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2109.384222 |
|---|---|
| Minimum | 0 |
| Maximum | 40425 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9800 |
| Maximum | 40425 |
| Range | 40425 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3773.414578 |
|---|---|
| Coefficient of variation (CV) | 1.788870201 |
| Kurtosis | 16.14513512 |
| Mean | 2109.384222 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.590179266 |
| Sum | 242543326 |
| Variance | 14238657.57 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 200 | 6803 | 5.9% | |
| 150 | 5316 | 4.6% | |
| 250 | 4647 | 4.0% | |
| 300 | 4281 | 3.7% | |
| 100 | 3532 | 3.1% | |
| 400 | 3461 | 3.0% | |
| 350 | 3201 | 2.8% | |
| 600 | 2770 | 2.4% | |
| 500 | 2758 | 2.4% | |
| 700 | 2099 | 1.8% | |
| 800 | 1858 | 1.6% | |
| 450 | 1795 | 1.6% | |
| 550 | 1684 | 1.5% | |
| 900 | 1477 | 1.3% | |
| 1000 | 1394 | 1.2% | |
| 1500 | 1368 | 1.2% | |
| 1200 | 1295 | 1.1% | |
| 850 | 1292 | 1.1% | |
| 650 | 1258 | 1.1% | |
| 1400 | 1148 | 1.0% | |
| 750 | 1109 | 1.0% | |
| 950 | 1107 | 1.0% | |
| 1100 | 1074 | 0.9% | |
| 1550 | 1054 | 0.9% | |
| 1050 | 999 | 0.9% | |
| Other values (2163) | 56203 | 48.9% |
| Value | Count | Frequency (%) | |
| 0 | 8 | < 0.1% | |
| 2 | 5 | < 0.1% | |
| 25 | 3 | < 0.1% | |
| 50 | 954 | 0.8% | |
| 53 | 2 | < 0.1% | |
| 54 | 2 | < 0.1% | |
| 55 | 1 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 60 | 7 | < 0.1% | |
| 61 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40425 | 3 | < 0.1% | |
| 30000 | 296 | 0.3% | |
| 29800 | 1 | < 0.1% | |
| 29750 | 1 | < 0.1% | |
| 29700 | 3 | < 0.1% | |
| 29600 | 5 | < 0.1% | |
| 29500 | 2 | < 0.1% | |
| 29250 | 1 | < 0.1% | |
| 29150 | 1 | < 0.1% | |
| 29100 | 1 | < 0.1% |
product_length_cm
Real number (ℝ≥0)
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.29019072 |
|---|---|
| Minimum | 7 |
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 62 |
| Maximum | 105 |
| Range | 98 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.17109963 |
|---|---|
| Coefficient of variation (CV) | 0.5338724929 |
| Kurtosis | 3.64605155 |
| Mean | 30.29019072 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.737664285 |
| Sum | 3482857 |
| Variance | 261.5044634 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 16 | 17642 | 15.3% | |
| 20 | 10544 | 9.2% | |
| 30 | 7786 | 6.8% | |
| 17 | 6092 | 5.3% | |
| 18 | 5759 | 5.0% | |
| 19 | 4816 | 4.2% | |
| 25 | 4746 | 4.1% | |
| 40 | 4220 | 3.7% | |
| 22 | 3914 | 3.4% | |
| 50 | 3067 | 2.7% | |
| 35 | 2986 | 2.6% | |
| 21 | 2436 | 2.1% | |
| 45 | 2427 | 2.1% | |
| 23 | 2336 | 2.0% | |
| 26 | 1876 | 1.6% | |
| 28 | 1774 | 1.5% | |
| 42 | 1745 | 1.5% | |
| 24 | 1710 | 1.5% | |
| 60 | 1701 | 1.5% | |
| 27 | 1499 | 1.3% | |
| 33 | 1472 | 1.3% | |
| 36 | 1454 | 1.3% | |
| 32 | 1351 | 1.2% | |
| 37 | 1333 | 1.2% | |
| 34 | 1327 | 1.2% | |
| Other values (74) | 18970 | 16.5% |
| Value | Count | Frequency (%) | |
| 7 | 32 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 9 | 4 | < 0.1% | |
| 10 | 8 | < 0.1% | |
| 11 | 96 | 0.1% | |
| 12 | 41 | < 0.1% | |
| 13 | 58 | 0.1% | |
| 14 | 137 | 0.1% | |
| 15 | 215 | 0.2% | |
| 16 | 17642 | 15.3% |
| Value | Count | Frequency (%) | |
| 105 | 322 | 0.3% | |
| 104 | 35 | < 0.1% | |
| 103 | 45 | < 0.1% | |
| 102 | 60 | 0.1% | |
| 101 | 108 | 0.1% | |
| 100 | 394 | 0.3% | |
| 99 | 36 | < 0.1% | |
| 98 | 49 | < 0.1% | |
| 97 | 11 | < 0.1% | |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ≥0)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.62439665 |
|---|---|
| Minimum | 2 |
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 45 |
| Maximum | 105 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.45599126 |
|---|---|
| Coefficient of variation (CV) | 0.8094123079 |
| Kurtosis | 7.295780169 |
| Mean | 16.62439665 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.243686579 |
| Sum | 1911523 |
| Variance | 181.0637008 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10 | 10154 | 8.8% | |
| 20 | 6754 | 5.9% | |
| 15 | 6741 | 5.9% | |
| 11 | 6259 | 5.4% | |
| 12 | 6186 | 5.4% | |
| 2 | 5067 | 4.4% | |
| 4 | 4750 | 4.1% | |
| 8 | 4749 | 4.1% | |
| 16 | 4653 | 4.0% | |
| 5 | 4610 | 4.0% | |
| 7 | 4195 | 3.6% | |
| 13 | 4050 | 3.5% | |
| 14 | 3633 | 3.2% | |
| 30 | 3570 | 3.1% | |
| 6 | 3492 | 3.0% | |
| 9 | 3361 | 2.9% | |
| 25 | 3325 | 2.9% | |
| 22 | 3194 | 2.8% | |
| 3 | 2765 | 2.4% | |
| 18 | 2352 | 2.0% | |
| 17 | 1870 | 1.6% | |
| 35 | 1723 | 1.5% | |
| 19 | 1518 | 1.3% | |
| 21 | 1303 | 1.1% | |
| 40 | 1077 | 0.9% | |
| Other values (77) | 13632 | 11.9% |
| Value | Count | Frequency (%) | |
| 2 | 5067 | 4.4% | |
| 3 | 2765 | 2.4% | |
| 4 | 4750 | 4.1% | |
| 5 | 4610 | 4.0% | |
| 6 | 3492 | 3.0% | |
| 7 | 4195 | 3.6% | |
| 8 | 4749 | 4.1% | |
| 9 | 3361 | 2.9% | |
| 10 | 10154 | 8.8% | |
| 11 | 6259 | 5.4% |
| Value | Count | Frequency (%) | |
| 105 | 138 | 0.1% | |
| 104 | 12 | < 0.1% | |
| 103 | 49 | < 0.1% | |
| 102 | 10 | < 0.1% | |
| 100 | 41 | < 0.1% | |
| 99 | 5 | < 0.1% | |
| 98 | 3 | < 0.1% | |
| 97 | 2 | < 0.1% | |
| 96 | 7 | < 0.1% | |
| 95 | 22 | < 0.1% |
product_width_cm
Real number (ℝ≥0)
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.10772897 |
|---|---|
| Minimum | 6 |
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 112 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.74928356 |
|---|---|
| Coefficient of variation (CV) | 0.5084568707 |
| Kurtosis | 4.571625601 |
| Mean | 23.10772897 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.707890227 |
| Sum | 2656996 |
| Variance | 138.0456642 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 20 | 12373 | 10.8% | |
| 11 | 10608 | 9.2% | |
| 15 | 8914 | 7.8% | |
| 16 | 8629 | 7.5% | |
| 30 | 7844 | 6.8% | |
| 12 | 5555 | 4.8% | |
| 13 | 5402 | 4.7% | |
| 14 | 4732 | 4.1% | |
| 18 | 4108 | 3.6% | |
| 40 | 4042 | 3.5% | |
| 25 | 3911 | 3.4% | |
| 17 | 3626 | 3.2% | |
| 35 | 3307 | 2.9% | |
| 22 | 2541 | 2.2% | |
| 19 | 2472 | 2.1% | |
| 21 | 1968 | 1.7% | |
| 23 | 1780 | 1.5% | |
| 28 | 1686 | 1.5% | |
| 26 | 1573 | 1.4% | |
| 29 | 1324 | 1.2% | |
| 32 | 1272 | 1.1% | |
| 27 | 1227 | 1.1% | |
| 50 | 1208 | 1.1% | |
| 36 | 1185 | 1.0% | |
| 33 | 1173 | 1.0% | |
| Other values (69) | 12523 | 10.9% |
| Value | Count | Frequency (%) | |
| 6 | 2 | < 0.1% | |
| 7 | 5 | < 0.1% | |
| 8 | 29 | < 0.1% | |
| 9 | 50 | < 0.1% | |
| 10 | 82 | 0.1% | |
| 11 | 10608 | 9.2% | |
| 12 | 5555 | 4.8% | |
| 13 | 5402 | 4.7% | |
| 14 | 4732 | 4.1% | |
| 15 | 8914 | 7.8% |
| Value | Count | Frequency (%) | |
| 118 | 8 | < 0.1% | |
| 105 | 14 | < 0.1% | |
| 104 | 1 | < 0.1% | |
| 102 | 2 | < 0.1% | |
| 101 | 2 | < 0.1% | |
| 100 | 43 | < 0.1% | |
| 98 | 1 | < 0.1% | |
| 97 | 1 | < 0.1% | |
| 95 | 2 | < 0.1% | |
| 93 | 17 | < 0.1% |
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 898.4 KiB |
| bed_bath_table | |
|---|---|
| health_beauty | |
| sports_leisure | |
| furniture_decor | |
| computers_accessories | |
| Other values (66) |
| Value | Count | Frequency (%) | |
| bed_bath_table | 11851 | 10.3% | |
| health_beauty | 9892 | 8.6% | |
| sports_leisure | 8876 | 7.7% | |
| furniture_decor | 8698 | 7.6% | |
| computers_accessories | 8048 | 7.0% | |
| housewares | 7270 | 6.3% | |
| watches_gifts | 6107 | 5.3% | |
| telephony | 4647 | 4.0% | |
| garden_tools | 4511 | 3.9% | |
| auto | 4340 | 3.8% | |
| toys | 4235 | 3.7% | |
| cool_stuff | 3941 | 3.4% | |
| perfumery | 3535 | 3.1% | |
| baby | 3156 | 2.7% | |
| electronics | 2824 | 2.5% | |
| stationery | 2595 | 2.3% | |
| fashion_bags_accessories | 2138 | 1.9% | |
| pet_shop | 2014 | 1.8% | |
| office_furniture | 1771 | 1.5% | |
| consoles_games | 1179 | 1.0% | |
| luggage_accessories | 1154 | 1.0% | |
| construction_tools_construction | 945 | 0.8% | |
| home_appliances | 816 | 0.7% | |
| musical_instruments | 708 | 0.6% | |
| small_appliances | 693 | 0.6% | |
| Other values (46) | 9040 | 7.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 39 |
|---|---|
| Median length | 13 |
| Mean length | 12.99052912 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 182788 | 12.2% | |
| s | 140147 | 9.4% | |
| t | 131583 | 8.8% | |
| o | 110416 | 7.4% | |
| r | 104346 | 7.0% | |
| a | 101068 | 6.8% | |
| _ | 100858 | 6.8% | |
| u | 77163 | 5.2% | |
| c | 71561 | 4.8% | |
| i | 62492 | 4.2% | |
| h | 58835 | 3.9% | |
| l | 58486 | 3.9% | |
| b | 55472 | 3.7% | |
| n | 48364 | 3.2% | |
| f | 37326 | 2.5% | |
| p | 34443 | 2.3% | |
| d | 30566 | 2.0% | |
| y | 29729 | 2.0% | |
| g | 20974 | 1.4% | |
| m | 20108 | 1.3% | |
| w | 13551 | 0.9% | |
| k | 2174 | 0.1% | |
| v | 689 | < 0.1% | |
| 2 | 298 | < 0.1% | |
| x | 266 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1392547 | 93.2% | |
| Connector Punctuation | 100858 | 6.8% | |
| Decimal Number | 298 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 182788 | 13.1% | |
| s | 140147 | 10.1% | |
| t | 131583 | 9.4% | |
| o | 110416 | 7.9% | |
| r | 104346 | 7.5% | |
| a | 101068 | 7.3% | |
| u | 77163 | 5.5% | |
| c | 71561 | 5.1% | |
| i | 62492 | 4.5% | |
| h | 58835 | 4.2% | |
| l | 58486 | 4.2% | |
| b | 55472 | 4.0% | |
| n | 48364 | 3.5% | |
| f | 37326 | 2.7% | |
| p | 34443 | 2.5% | |
| d | 30566 | 2.2% | |
| y | 29729 | 2.1% | |
| g | 20974 | 1.5% | |
| m | 20108 | 1.4% | |
| w | 13551 | 1.0% | |
| k | 2174 | 0.2% | |
| v | 689 | < 0.1% | |
| x | 266 | < 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 100858 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 298 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1392547 | 93.2% | |
| Common | 101156 | 6.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 182788 | 13.1% | |
| s | 140147 | 10.1% | |
| t | 131583 | 9.4% | |
| o | 110416 | 7.9% | |
| r | 104346 | 7.5% | |
| a | 101068 | 7.3% | |
| u | 77163 | 5.5% | |
| c | 71561 | 5.1% | |
| i | 62492 | 4.5% | |
| h | 58835 | 4.2% | |
| l | 58486 | 4.2% | |
| b | 55472 | 4.0% | |
| n | 48364 | 3.5% | |
| f | 37326 | 2.7% | |
| p | 34443 | 2.5% | |
| d | 30566 | 2.2% | |
| y | 29729 | 2.1% | |
| g | 20974 | 1.5% | |
| m | 20108 | 1.4% | |
| w | 13551 | 1.0% | |
| k | 2174 | 0.2% | |
| v | 689 | < 0.1% | |
| x | 266 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 100858 | 99.7% | |
| 2 | 298 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1493703 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 182788 | 12.2% | |
| s | 140147 | 9.4% | |
| t | 131583 | 8.8% | |
| o | 110416 | 7.4% | |
| r | 104346 | 7.0% | |
| a | 101068 | 6.8% | |
| _ | 100858 | 6.8% | |
| u | 77163 | 5.2% | |
| c | 71561 | 4.8% | |
| i | 62492 | 4.2% | |
| h | 58835 | 3.9% | |
| l | 58486 | 3.9% | |
| b | 55472 | 3.7% | |
| n | 48364 | 3.2% | |
| f | 37326 | 2.5% | |
| p | 34443 | 2.3% | |
| d | 30566 | 2.0% | |
| y | 29729 | 2.0% | |
| g | 20974 | 1.4% | |
| m | 20108 | 1.3% | |
| w | 13551 | 0.9% | |
| k | 2174 | 0.1% | |
| v | 689 | < 0.1% | |
| 2 | 298 | < 0.1% | |
| x | 266 | < 0.1% |
payment_installments
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.946844778 |
|---|---|
| Minimum | 0 |
| Maximum | 24 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.781830288 |
|---|---|
| Coefficient of variation (CV) | 0.9440029919 |
| Kurtosis | 2.521086334 |
| Mean | 2.946844778 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.619280771 |
| Sum | 338840 |
| Variance | 7.738579749 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 57269 | 49.8% | |
| 2 | 13332 | 11.6% | |
| 3 | 11499 | 10.0% | |
| 4 | 7816 | 6.8% | |
| 10 | 6761 | 5.9% | |
| 5 | 5916 | 5.1% | |
| 8 | 4965 | 4.3% | |
| 6 | 4518 | 3.9% | |
| 7 | 1766 | 1.5% | |
| 9 | 711 | 0.6% | |
| 12 | 164 | 0.1% | |
| 15 | 91 | 0.1% | |
| 18 | 38 | < 0.1% | |
| 24 | 34 | < 0.1% | |
| 11 | 25 | < 0.1% | |
| 20 | 20 | < 0.1% | |
| 13 | 19 | < 0.1% | |
| 14 | 15 | < 0.1% | |
| 16 | 7 | < 0.1% | |
| 17 | 7 | < 0.1% | |
| 21 | 6 | < 0.1% | |
| 0 | 3 | < 0.1% | |
| 23 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3 | < 0.1% | |
| 1 | 57269 | 49.8% | |
| 2 | 13332 | 11.6% | |
| 3 | 11499 | 10.0% | |
| 4 | 7816 | 6.8% | |
| 5 | 5916 | 5.1% | |
| 6 | 4518 | 3.9% | |
| 7 | 1766 | 1.5% | |
| 8 | 4965 | 4.3% | |
| 9 | 711 | 0.6% |
| Value | Count | Frequency (%) | |
| 24 | 34 | < 0.1% | |
| 23 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 21 | 6 | < 0.1% | |
| 20 | 20 | < 0.1% | |
| 18 | 38 | < 0.1% | |
| 17 | 7 | < 0.1% | |
| 16 | 7 | < 0.1% | |
| 15 | 91 | 0.1% | |
| 14 | 15 | < 0.1% |
payment_value
Real number (ℝ≥0)
| Distinct | 28538 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.7658347 |
|---|---|
| Minimum | 0 |
| Maximum | 13664.08 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 27.2815 |
| Q1 | 61 |
| median | 108.19 |
| Q3 | 189.5725 |
| 95-th percentile | 515.3055 |
| Maximum | 13664.08 |
| Range | 13664.08 |
| Interquartile range (IQR) | 128.5725 |
Descriptive statistics
| Standard deviation | 267.7545692 |
|---|---|
| Coefficient of variation (CV) | 1.549812031 |
| Kurtosis | 516.8024147 |
| Mean | 172.7658347 |
| Median Absolute Deviation (MAD) | 56.69 |
| Skewness | 14.25757917 |
| Sum | 19865306.74 |
| Variance | 71692.50934 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50 | 338 | 0.3% | |
| 100 | 296 | 0.3% | |
| 20 | 281 | 0.2% | |
| 77.57 | 245 | 0.2% | |
| 35 | 162 | 0.1% | |
| 73.34 | 158 | 0.1% | |
| 30 | 132 | 0.1% | |
| 116.94 | 132 | 0.1% | |
| 56.78 | 119 | 0.1% | |
| 155.14 | 119 | 0.1% | |
| 107.78 | 118 | 0.1% | |
| 25 | 117 | 0.1% | |
| 65 | 113 | 0.1% | |
| 99.9 | 106 | 0.1% | |
| 86.15 | 105 | 0.1% | |
| 45 | 102 | 0.1% | |
| 87.64 | 102 | 0.1% | |
| 67.5 | 101 | 0.1% | |
| 105.28 | 98 | 0.1% | |
| 31.75 | 97 | 0.1% | |
| 64 | 96 | 0.1% | |
| 45.09 | 94 | 0.1% | |
| 37.77 | 93 | 0.1% | |
| 64.1 | 92 | 0.1% | |
| 65.71 | 90 | 0.1% | |
| Other values (28513) | 111478 | 97.0% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 0.01 | 6 | < 0.1% | |
| 0.03 | 2 | < 0.1% | |
| 0.05 | 2 | < 0.1% | |
| 0.08 | 2 | < 0.1% | |
| 0.09 | 1 | < 0.1% | |
| 0.1 | 3 | < 0.1% | |
| 0.11 | 2 | < 0.1% | |
| 0.13 | 1 | < 0.1% | |
| 0.14 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 13664.08 | 8 | < 0.1% | |
| 7274.88 | 4 | < 0.1% | |
| 6929.31 | 1 | < 0.1% | |
| 6922.21 | 1 | < 0.1% | |
| 6726.66 | 1 | < 0.1% | |
| 6081.54 | 6 | < 0.1% | |
| 4950.34 | 1 | < 0.1% | |
| 4809.44 | 2 | < 0.1% | |
| 4764.34 | 1 | < 0.1% | |
| 4681.78 | 1 | < 0.1% |
encodedCategory
Real number (ℝ≥0)
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.8608763 |
|---|---|
| Minimum | 0 |
| Maximum | 70 |
| Zeros | 247 |
| Zeros (%) | 0.2% |
| Memory size | 449.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 15 |
| median | 42 |
| Q3 | 60 |
| 95-th percentile | 70 |
| Maximum | 70 |
| Range | 70 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 22.51523877 |
|---|---|
| Coefficient of variation (CV) | 0.5793806244 |
| Kurtosis | -1.357800642 |
| Mean | 38.8608763 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.1621643987 |
| Sum | 4468379 |
| Variance | 506.935977 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 11851 | 10.3% | |
| 43 | 9892 | 8.6% | |
| 65 | 8876 | 7.7% | |
| 39 | 8698 | 7.6% | |
| 15 | 8048 | 7.0% | |
| 49 | 7270 | 6.3% | |
| 70 | 6107 | 5.3% | |
| 68 | 4647 | 4.0% | |
| 42 | 4511 | 3.9% | |
| 5 | 4340 | 3.8% | |
| 69 | 4235 | 3.7% | |
| 20 | 3941 | 3.4% | |
| 59 | 3535 | 3.1% | |
| 6 | 3156 | 2.7% | |
| 26 | 2824 | 2.5% | |
| 66 | 2595 | 2.3% | |
| 28 | 2138 | 1.9% | |
| 60 | 2014 | 1.8% | |
| 57 | 1771 | 1.5% | |
| 16 | 1179 | 1.0% | |
| 53 | 1154 | 1.0% | |
| 17 | 945 | 0.8% | |
| 44 | 816 | 0.7% | |
| 56 | 708 | 0.6% | |
| 63 | 693 | 0.6% | |
| Other values (46) | 9040 | 7.9% |
| Value | Count | Frequency (%) | |
| 0 | 247 | 0.2% | |
| 1 | 297 | 0.3% | |
| 2 | 208 | 0.2% | |
| 3 | 24 | < 0.1% | |
| 4 | 380 | 0.3% | |
| 5 | 4340 | 3.8% | |
| 6 | 3156 | 2.7% | |
| 7 | 11851 | 10.3% | |
| 8 | 557 | 0.5% | |
| 9 | 61 | 0.1% |
| Value | Count | Frequency (%) | |
| 70 | 6107 | 5.3% | |
| 69 | 4235 | 3.7% | |
| 68 | 4647 | 4.0% | |
| 67 | 87 | 0.1% | |
| 66 | 2595 | 2.3% | |
| 65 | 8876 | 7.7% | |
| 64 | 75 | 0.1% | |
| 63 | 693 | 0.6% | |
| 62 | 199 | 0.2% | |
| 61 | 2 | < 0.1% |
TargetVar
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 898.4 KiB |
| 1 | |
|---|---|
| 0 | 553 |
| Value | Count | Frequency (%) | |
| 1 | 114431 | 99.5% | |
| 0 | 553 | 0.5% |
Days_to_deliver
Real number (ℝ≥0)
| Distinct | 93322 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.84195696 |
|---|---|
| Minimum | 2.008009259 |
| Maximum | 155.135463 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 898.4 KiB |
Quantile statistics
| Minimum | 2.008009259 |
|---|---|
| 5-th percentile | 10.52446817 |
| Q1 | 18.38960648 |
| median | 23.24926505 |
| Q3 | 28.4716985 |
| 95-th percentile | 38.62075231 |
| Maximum | 155.135463 |
| Range | 153.1274537 |
| Interquartile range (IQR) | 10.08209201 |
Descriptive statistics
| Standard deviation | 8.865754856 |
|---|---|
| Coefficient of variation (CV) | 0.3718551656 |
| Kurtosis | 4.949608203 |
| Mean | 23.84195696 |
| Median Absolute Deviation (MAD) | 5.056261574 |
| Skewness | 0.9901440265 |
| Sum | 2741443.58 |
| Variance | 78.60160916 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 21.14825231 | 63 | 0.1% | |
| 29.37725694 | 38 | < 0.1% | |
| 20.49641204 | 26 | < 0.1% | |
| 22.30940972 | 24 | < 0.1% | |
| 20.01428241 | 24 | < 0.1% | |
| 15.42038194 | 24 | < 0.1% | |
| 23.47988426 | 24 | < 0.1% | |
| 31.37797454 | 24 | < 0.1% | |
| 25.4999537 | 22 | < 0.1% | |
| 16.45383102 | 22 | < 0.1% | |
| 11.2365162 | 21 | < 0.1% | |
| 8.984513889 | 21 | < 0.1% | |
| 36.52185185 | 21 | < 0.1% | |
| 13.35369213 | 20 | < 0.1% | |
| 24.2334375 | 20 | < 0.1% | |
| 28.6093287 | 20 | < 0.1% | |
| 17.31607639 | 19 | < 0.1% | |
| 43.40006944 | 16 | < 0.1% | |
| 20.05296296 | 16 | < 0.1% | |
| 10.41211806 | 15 | < 0.1% | |
| 55.99005787 | 15 | < 0.1% | |
| 10.5509375 | 15 | < 0.1% | |
| 35.09387731 | 15 | < 0.1% | |
| 25.14523148 | 15 | < 0.1% | |
| 39.06083333 | 15 | < 0.1% | |
| Other values (93297) | 114429 | 99.5% |
| Value | Count | Frequency (%) | |
| 2.008009259 | 1 | < 0.1% | |
| 2.010451389 | 1 | < 0.1% | |
| 2.024074074 | 1 | < 0.1% | |
| 2.026018519 | 1 | < 0.1% | |
| 2.028287037 | 1 | < 0.1% | |
| 2.042326389 | 1 | < 0.1% | |
| 2.042986111 | 1 | < 0.1% | |
| 2.045752315 | 1 | < 0.1% | |
| 2.047291667 | 1 | < 0.1% | |
| 2.051412037 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 155.135463 | 1 | < 0.1% | |
| 149.5922801 | 2 | < 0.1% | |
| 146.2491319 | 1 | < 0.1% | |
| 144.8952431 | 1 | < 0.1% | |
| 140.0634722 | 2 | < 0.1% | |
| 116.0978588 | 1 | < 0.1% | |
| 109.3422917 | 2 | < 0.1% | |
| 106.992338 | 1 | < 0.1% | |
| 101.0100116 | 1 | < 0.1% | |
| 99.13206019 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | price | freight_value | review_score | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | payment_installments | payment_value | encodedCategory | TargetVar | Days_to_deliver | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 58.9 | 13.29 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 2 | 72.19 | 20 | 1 | 15.625671 |
| 1 | 1 | 55.9 | 17.96 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 1 | 73.86 | 20 | 1 | 27.505324 |
| 2 | 2 | 64.9 | 18.33 | 4 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 2 | 83.23 | 20 | 1 | 19.565359 |
| 3 | 3 | 58.9 | 16.17 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 3 | 75.07 | 20 | 1 | 23.223125 |
| 4 | 4 | 58.9 | 13.29 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 4 | 72.19 | 20 | 1 | 21.091204 |
| 5 | 5 | 55.9 | 26.93 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 1 | 82.83 | 20 | 1 | 27.366771 |
| 6 | 6 | 64.9 | 38.50 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 1 | 103.40 | 20 | 1 | 24.124491 |
| 7 | 7 | 58.9 | 18.12 | 5 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 1 | 153.75 | 20 | 1 | 31.292303 |
| 8 | 8 | 58.9 | 17.83 | 5 | 6.0 | 530.0 | 30.0 | 9.0 | 14.0 | cool_stuff | 1 | 153.75 | 20 | 1 | 31.292303 |
| 9 | 9 | 55.9 | 35.71 | 1 | 4.0 | 650.0 | 28.0 | 9.0 | 14.0 | cool_stuff | 1 | 20.00 | 20 | 1 | 30.484502 |
Last rows
| df_index | price | freight_value | review_score | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | payment_installments | payment_value | encodedCategory | TargetVar | Days_to_deliver | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 114974 | 116571 | 119.90 | 27.14 | 3 | 1.0 | 2400.0 | 20.0 | 30.0 | 30.0 | bed_bath_table | 5 | 147.04 | 7 | 1 | 32.241736 |
| 114975 | 116572 | 19.00 | 15.79 | 4 | 3.0 | 150.0 | 16.0 | 9.0 | 14.0 | toys | 1 | 69.58 | 69 | 1 | 21.114560 |
| 114976 | 116573 | 19.00 | 15.79 | 4 | 3.0 | 150.0 | 16.0 | 9.0 | 14.0 | toys | 1 | 69.58 | 69 | 1 | 21.114560 |
| 114977 | 116574 | 35.99 | 16.60 | 5 | 1.0 | 1850.0 | 20.0 | 20.0 | 20.0 | food_drink | 1 | 52.59 | 37 | 1 | 27.418333 |
| 114978 | 116575 | 146.90 | 15.20 | 1 | 2.0 | 350.0 | 18.0 | 15.0 | 16.0 | home_construction | 1 | 162.10 | 48 | 1 | 20.280139 |
| 114979 | 116576 | 129.90 | 51.20 | 5 | 1.0 | 6700.0 | 35.0 | 12.0 | 22.0 | garden_tools | 1 | 181.10 | 42 | 1 | 24.163831 |
| 114980 | 116577 | 99.00 | 13.52 | 4 | 1.0 | 2300.0 | 37.0 | 30.0 | 20.0 | furniture_decor | 2 | 112.52 | 39 | 1 | 4.582650 |
| 114981 | 116578 | 736.00 | 20.91 | 5 | 3.0 | 400.0 | 19.0 | 9.0 | 15.0 | watches_gifts | 1 | 756.91 | 70 | 1 | 24.296493 |
| 114982 | 116579 | 229.90 | 44.02 | 4 | 2.0 | 2700.0 | 60.0 | 15.0 | 15.0 | sports_leisure | 7 | 273.92 | 65 | 1 | 36.310336 |
| 114983 | 116580 | 43.00 | 12.79 | 5 | 1.0 | 600.0 | 30.0 | 3.0 | 19.0 | bed_bath_table | 1 | 55.79 | 7 | 1 | 18.291458 |